Convolutional Networks Used to Classify Video and Audio Data
نویسندگان
چکیده
منابع مشابه
Jointly Learning to Locate and Classify Words Using Convolutional Networks
In this paper, we propose a novel approach for weaklysupervised word recognition. Most state of the art automatic speech recognition systems are based on frame-level labels obtained through forced alignments or through a sequential loss. Recently, weakly-supervised trained models have been proposed in vision, that can learn which part of the input is relevant for classifying a given pattern [1]...
متن کاملUsing Convolutional Neural Networks to Classify Hate-Speech
The paper introduces a deep learningbased Twitter hate-speech text classification system. The classifier assigns each tweet to one of four predefined categories: racism, sexism, both (racism and sexism) and non-hate-speech. Four Convolutional Neural Network models were trained on resp. character 4-grams, word vectors based on semantic information built using word2vec, randomly generated word ve...
متن کاملSpatially Encoding Temporal Correlations to Classify Temporal Data Using Convolutional Neural Networks
We propose an off-line approach to explicitly encode temporal patterns spatially as different types of images, namely, Gramian Angular Fields and Markov Transition Fields. This enables the use of techniques from computer vision for feature learning and classification. We used Tiled Convolutional Neural Networks to learn high-level features from individual GAF, MTF, and GAF-MTF images on 12 benc...
متن کاملAudio Deepdream: Optimizing Raw Audio with Convolutional Networks
The hallucinatory images of DeepDream [8] opened up the floodgates for a recent wave of artwork generated by neural networks. In this work, we take first steps to applying this to audio. We believe a key to solving this problem is training a deep neural network to perform a perception task on raw audio. Consequently, we have followed in the footsteps of Van den Oord et al [13] and trained a net...
متن کاملA Data Model for Audio-Video Data
Audio and video data have become critical components of information systems and these systems need to e ciently manage the large storage requirements of this type of data. However, there are no formal data models for audio video data. In this paper, we present an algebraic formalism which attempts to provide the underpinnings for the design and implementation of systems which organize and query...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Research Papers Faculty of Materials Science and Technology Slovak University of Technology
سال: 2019
ISSN: 1338-0532
DOI: 10.2478/rput-2019-0034